A Sinusoidal Noise Model Based Speech Synthesis For Phoneme Transition

نویسندگان

H. M. L. N. K Herath

J. V Wijayakulasooriya

چکیده

One well-known problem with speech synthesis is the occurrence of audible discontinuities at phoneme boundaries, which lead to the unnaturalness of synthetic speech. This paper presents a sinusoidal noise based mathematical method to reform the transition regions from one phoneme to another phoneme with low storage. The speech parameters of sinusoidal noise model were estimated and stored as polynomials to reconstruct the transition wave. According to the results, all transitions regions which are considered during this experiment have higher correlation values for lower order polynomial with less capacity ratio. In addition, to that the same experiment has been carried out by changing the number of FFT coefficient. As the FFT coefficient increases, capacity ratio was also increased, while correlation coefficient values were also increased. It was understood that a signal which is very close to the original signal can be generated with a lesser number of FFT coefficients

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Polynomial quasi-harmonic models for speech analysis and synthesis

Harmonic plus noise models have been successfully applied to a broad range of speech processing applications, including, among others, low bit-rate speech coding, and speech restoration and transformation. In conventional methods, the frequencies, the relative phases and the amplitudes of the pitch-harmonic components are assumed to be piecewise constants over an analysis frame. This assumption...

متن کامل

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Accurate estimation of sinusoidal parameters in an harmonic+noise model for speech synthesis

We present here an Harmonic+Noise Model (HNM) for speech synthesis. The noise part is represented by an autoregressive model whose output is pitchsynchronously modulated in energy. The harmonic part of the signal is represented by a sinusoidal model. This paper compares di erent methods for separating these two components. We then propose a method for the estimation of the sinusoidal parameters...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Parameterizing Speech Phonemes by Exponential Sinusoidal Model

The exponential sinusoidal model (ESM) for parameterizing speech signals is proposed in this paper. The main feature of the ESM is that the amplitude of each sinusoidal component is allowed to vary exponentially with time. A novel variable segmentation strategy is applied first to separate individual transients of a voiced phoneme, which can then be fitted with the ESM. The epoch of a transient...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

A Sinusoidal Noise Model Based Speech Synthesis For Phoneme Transition

نویسندگان

چکیده

منابع مشابه

Polynomial quasi-harmonic models for speech analysis and synthesis

Allophone-based acoustic modeling for Persian phoneme recognition

Accurate estimation of sinusoidal parameters in an harmonic+noise model for speech synthesis

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Parameterizing Speech Phonemes by Exponential Sinusoidal Model

عنوان ژورنال:

اشتراک گذاری